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Abstract: In this work we study preference systems natural for the Peer-to-Peer paradigm. Most of them 
fall in three categories: global, symmetric and complementary. All these systems share an acyclicity property. 
As a consequence, they admit a stable (or Pareto efficient) configuration, where no participant can collaborate 
with better partners than their current ones. 

We analyze the representation of the such preference systems and show that any acyclic system can be 
represented with a symmetric mark matrix. This gives a method to merge acyclic preference systems and 
retain the acyclicity. We also consider such properties of the corresponding collaboration graph, as clustering 
coefficient and diameter. In particular, studying the example of preferences based on real latency measurements, 
we observe that its stable configuration is a small-world graph. 

Key-words: P2P, stable marriage theory, rational choice theory, collaborative systems, BitTorrent, overlay 
network, matchings, graph theory 
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Systemes de preferences acycliques dans les reseaux pair-a-pair 



Resume : Cet article est consacre a des systemes de preferences naturels dans les reseaux pair-a-pair. La 
plupart de ces systemes appartiennent a Tune des trois classes de preferences suivantes : globale, symetrique et 
complementaire. Ces preferences ont la particularite d'etre acycliques. En consequence, ils possedent une unique 
configuration stable (au sens de Pareto), ou aucun participant ne peut collaborer avec de meilleurs partenaires 
que ses partenaires courants. 

En analysant leur representation, nous montrons que tout systeme de preferences acycliques peut etre re- 
presents par une matrice de notes symetriques. Nous obtenons ainsi une methode pour fusionner des systemes 
de preferences acycliques en conservant la propriete d'acyclicite. Nous etudions egalement des proprietes du 
graphe de collaboration de la configuration stable, comme le diametre ou le coefficient de clustering. Sur un 
example reel de preferences basees sur des mesures de latences, nous observons que la configuration stable a des 
caracteristiques petit-monde. 

Mots-cles : pair-a-pair, mariages stables, choix rationnels, systemes collaboratifs, BitTorrent, reseaux overlay, 
couplages, graphes 
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1 Introduction 

Motivation In most current peer-to-peer (P2P) solutions participants are encouraged to cooperate with each 
other. Since collaborations may be costly in terms of network resources (connection establishment, resource 
consumption, maintenance), the number of connections is often bounded by the protocol. This constraint 
encourages the clients to make a careful choice among others to obtain a good performance from the system. 
The possibility to choose a better partner implies that there exists a preference system, which describes the 
interests of each peer. 

The study of such preference systems is the subject of 6-matching theory. It has started forty-five years 
ago with the seminal work of Gale and Shapley on stable marriages [5]. Although the original paper had a 
certain recreational mathematics flavor, the model turned out to be especially valuable both in theory and 
practice. Today, 6-matching's applications are not limited to dating agencies, but include college admissions, 
roommates attributions, assignment of graduating medical students to their first hospital appointments, or 
kidney exchanges programs El H fTl] . The goal of the present paper is to expand ^-matching application 
domain to P2P networks by using it to model the interactions between the clients of such networks. 

Previous work In [10] we covered general aspects of the 6-matching theory application to the dynamics of the 
node interactions. We considered preference systems natural for the P2P paradigm, and showed that most of 
them fall into three categories: global, symmetric, and complementary. We demonstrated that these systems 
share the same property: acyclicity. We proved existence and uniqueness of a stable configuration for acyclic 
preference systems. 

Contribution In this article, we analyze the links between properties of local marks and the preference lists that 
are generated with those marks. We show that all acyclic systems can be created with symmetric marks. We 
provide a method to merge any two acyclic preference systems and retain the acyclic property. And finally our 
simulations show that real latency marks create collaboration graphs with small-worlds properties, in contrast 
with random symmetric or global marks. 

Roadmap In Section [2] we define the global, symmetric, complementary, and acyclic preference systems, and 
provide a formal description of our model. In Section [3] we demonstrate that all acyclic preferences can be 
represented using symmetric preferences. We consider complementary preferences in Section [U and the results 
are extended to any linear combination of global or symmetric systems. Section [5] discusses the properties of a 
stable solution providing an example based on Meridian project measurements [13]. In Section [6] we discuss the 
impact of our results, and Section [7] concludes. 

2 Definition and applications of P2P preference systems 

2.1 Definitions and general modeling assumptions 

We formalize here a 6-matching model for common P2P preference systems. 

Acceptance graph Peers may have a partial knowledge of the network and are not necessarily aware of all 
other participating nodes. Peers may also want to avoid collaboration with certain others. Such criteria are 
represented by an acceptance graph G(V, E). Neighbors of a peer p € V are the nodes that may collaborate 
with p. A configuration C is a subset C C E of the existing collaborations at a given time. 

Marks We assume peers use some real marks (like latency, bandwidth,. . . ) to rank their neighbors. This is 
represented by a valued matrix of marks m = {m(i, j)}. A peer p uses m(p, i) and m(p, j) to compare i and j. 
Without loss of generality, we assume that is the best mark and m(p, i) < m(p, j) if and only if p prefers i to j. 
If p is not a neighbor of q, then m{p 1 q) = oo. We assume for convenience a peer p has a different mark for each of 
its neighbors. It implies that a peer can always compare two neighbors and decide which one suits better to him. 

Preference system A mark matrix M creates an instance L of a preference system. L(p) is a preference list that 
indicates how a peer p ranks its neighbors. The relation when p prefers qi to qi is denoted by L(p, q{) < L(p, (fc)- 
Note that different mark matrices can produce the same preference system. 
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Global preferences A preference system is global if it can be deduced from global marks (m(i,p) = m(j,p) = 
m(p)). 

Symmetric preferences A preferences system is symmetric if it can be deduced from symmetric marks 
(m(i,j) = m(j,i) for all 

Complementary preferences A preferences system is complementary if it can be deduced from marks of the 
form m(i,j) = v(j)— c(i,j), where v(j) values the resources possessed by j and c(i,j) the resources that i and 
j have in common_j. 

Acyclic preferences A preferences system is acyclic if it contains no preference cycle. A preference cycle is a 
cycle of at least three peers such that eacu peer strictly prefers its successor to its predecessor along the cycle. 

Quotas Each peer p has a quota b(p) (possibly infinite) on the number of links it can support. A b-matching is 
a configuration C that respects the quotas. If the quotas are greater than the number of possible collaborations, 
then simply C = E would be an optimal solution for all. 

Blocking pairs We assume that the nodes aim to improve their situation, i.e. to link to most preferred neighbors. 
A pair of neighbors p and q is a blocking pair of a configuration C if {p, q} <E E \ C and both prefer to change 
the configuration C and to link with the each other. We assume that system evolves by discrete steps. At any 
step two nodes can be linked together if and only if they form a blocking pair. Those nodes may drop their 
worst performing links to stay within their quotas. A configuration C is stable if no blocking pairs exist. 

Loving pair Peers p, q form a loving pair if p prefers q to all its other neighbors and q, in its turn, prefers p to 
all other neighbors. It implies a strong link which cannot be destroyed in the given preference system. 

2.2 Preference systems and application design 

Depending on the P2P application, several important criterion can be used by a node to choose its collaborators. 
We introduce the following three types as representative of most situations: 

Proximity: distances in the physical network, in a virtual space or similarities according to some characteristics, 
Capacity related: network bandwidth, computing capacity, storage capacity, 

Distinction: complementary character of resources owned by different peers. 

Notice that theses types correspond respectively to the definitions of symmetric, global and complementary 
preference system categories. 

Examples of symmetric preferences are P2P applications which optimize latencies. A classical approach 
for distributed hash-table lists of contacts is selecting the contacts with the smallest round trip time (RTT) 
in the physical network. In Pastry [15], a node will always prefer contacts with smallest RTT among all the 
contacts that can fit into a given routing table entry. More generally, building a low latency overlay network 
with bounded degree requires to select neighbors with small RTTs. Optimizing latencies between players can 
also be crucial, for instance, for online real-time gaming applications [H]. Such preferences are symmetric since 
the mark a peer p gives to some peer q is the same as the mark q gives to p (the RTT between p and q). 

Similarly, massively multiplayer online games (MMOG) require connecting players with nearby coordinates 
in a virtual space [51 [8]. Again this can be modeled by symmetric preferences based on the distance in the virtual 
space. Some authors also propose to connect participants of a file sharing system according to the similarity of 
their interests [21 [16], which is also a symmetric relation. 

BitTorrent [2] is an example of a P2P application that uses a capacity related preference system. In brief, 
a BitTorrent peer uploads to peers it has most downloaded from during the last ten seconds. This is an 
implementation of the well known Tit-for-Tat strategy. The mark of a peer can thus be seen has its upload 
capacity divided by its collaboration quota. 

This global preference nature of BitTorrent should be tempered by the fact that only peers with complemen- 
tary parts of the file are selected. Pushing forward this requirement would lead to another selection criterion for 
BitTorrent: preference for the peers possessing the most complementary set of file pieces. In other words, each 

'Of course, in this case the preferred neighbor has a larger mark. 
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peer should try to exchange with peers possessing a large number of blocks it needs. We call this a complemen- 
tary preference system. Note, that this kind of preferences changes continuously as new pieces are downloaded. 
However, the peers with the most complementary set of blocks are those, who enable longest exchange sessions. 

In its more general form, the selection of partners for cooperative file download can be seen as a mix of 
several global, symmetric, and complementary preference systems. 

3 Acyclic preferences equivalence 

In [10], we showed that global, symmetric and complementary preferences are acyclic, and that any acyclic 
6-matching preference instance has a unique stable configuration. And acyclic systems always converge toward 
their stable configuration. However, since acyclicity is not defined by construction, one can ask if other kind of 
acyclic preferences exist. This section is devoted to answer this question. 

Theorem 1. Let P be a set of n peers, A be the set of all possible acyclic preference instances on P, S be the 
set of all possible symmetric preference instances on P, Q be the set of all possible global preference instances, 
then 

Q^A = S 

The rest of this section consists of the proof of theorem[TJ we will first show S C A and Q C A, then A C S, 
which will be followed by Q ^ A. 

Lemma 1. Global and symmetric preference systems are acyclic 

Proof, from JWj Let us assume the contrary, and assume that there is a circular list of peers p\ , . . . , pk (with 
k > 3), such that each peer of the list strictly prefers its successor to its predecessor. Written in the form of 
marks it means that m(jpi,pi+i) < m(pi,pi-i) for all i modulo k. Taking a sum for all possible i, we get 

k k 

'^2m(p l ,p l+1 ) < ^m(pj,pi-i). 

If marks are global, this can be rewritten Xw=i m (Pi) < Si=i m {Pi)i an d if they are symmetric, Yli=i m (Pi>Pi+i) < 
2j=i m(pi,pi + i). Both are impossible, thus global and symmetric marks create acyclic instances. □ 

□ 

The next part, A C S, uses the loving pairs described in 12.11 We first prove the existence of loving pairs in 
Lemma [2l 

Lemma 2. A nontrivial acyclic preference instance always admits at least one loving pair. 

Proof. A formal proof was presented in [10]. In short, if there is no loving pair, one can construct a preference 
cycle by considering a sequence of first choices of peers. □ 



Algorithm 1: Construction of a symmetric note matrix m given an acyclic preferences instance L on n 
peers 
N ~ 

for all p and q, m(j>, q) = +oo (by default, peers do not accept each other) 
while there exists a loving pair {a, b} do 
m(a, b) := m(b, a) := N 

Remove a from the preference list L{b) and b from L(a) 
N := N + 1 



Lemma 3. Let L be a preference instance. Algorithm^ constructs a symmetric mark matrix in 0{n 2 ) time 
that produces L. 
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Proof. The matrix output is clearly symmetric. Neighboring peers get finite marks, while others have infinite 
marks. If an instance contains a loving pair {a, b} then m(a,b) = m(b,a) can be the best mark since a and b 
mutually prefer to any other peers. According to Lemma [2] such a loving pair always exists in acyclic case. By 
removing the peers a and b from their preferences lists, we are lead to a smaller acyclic instance with the same 
preference lists except a and b are now unacceptable to each other. The process continues until all preference 
lists are eventually empty. The marks are given in increasing order, therefore when m(j>,q) and m(p,r) are 
finite, m(p, q) < m(p, r) iff the loving pair {p, q} is formed before the loving pair {p, r} iff p prefers q to r. 

The algorithm runs in 0(n 2 ) time because an iteration of the while loop takes O(l) time. A loving pair can 
especially be found in constant time by maintaining a list on all loving pairs. The list is updated in constant 
time since, after a and b became mutually unacceptable, each new loving pair contains either a and its new first 
choice, or b and its new first choice. □ □ 

All acyclic preferences are not global preferences. A simple counter-example uses 4 peers pi, P2, P3 and P4 
with the following preference lists: 

L(Pi) ■ P2,P3,P4 L(p 2 ) : Pi,P3,P4 L(p 3 ) : p 4 ,pi,p 2 L(p A ) : Ps,Pi,P2 

L is acyclic, but p\ prefers p 2 to p% whereas p^ prefers P3 to pi. p\ and P4 rate p 2 and p% differently, thus 
the instance is not global. 



4 Complementary and Composite Preference Systems 

Complementary preferences appear in systems where peers are equally interested in the resources they do 
not have yet. As said in Section 12.11 complementary preferences can be deduced from marks of the form 
m (p, q) = v (q) — C (P, q) ( m this case, marks of higher values are preferred). 

The expression of a complementary mark matrix m shows it is a linear combination of previously discussed 
global and symmetric mark matrices: m = v — c, where v defines a global preference system and c defines a 
symmetric system. 

Theorem [2] shows that complementary marks, and more generally any linear combination of global or sym- 
metric marks, produce acyclic preferences. 

Theorem 2. Let mi and iri2 be global or symmetric marks. Any linear combination of Ami + M?ri2 is acyclic. 

Proof. The proof is practically the same as for Lemma [TJ Let us suppose that the preference system induced 
by m = Ami + [irn-i contains a preference cycle pi,P2, ■ ■ ■ ,Pk,Pk+i = Pi, for k > 3. We assume wlog that mi is 
global, m 2 symmetric and that marks of higher values are preferred for m. Then m(pi,pi+i) > m{p.i,pi-i) for 
all i modulo k. Taking a sum over all possible i, we get 



^m(pi,pi + i) >^m(p l ,p i ^ 1 ) = ^ (Pi,Pi-i) + Vm 2 (pi,Pi-i)) , but 

i—l i—1 i—l 

(\mi(pi,pj-i) + p,m 2 (pi,Pi-i)) = Ayjmi(pj_i) +^yjm 2 (pi-i,Pi) 

i—l i—l i—l 

k k 



\} j m 1 (p i+1 ) + (j,y]m2(pi,Pi+i) 

i=l i=l 
k 

^2m(pi,p i+1 ). 



i=l 



This contradiction proves the Theorem. □ □ 

Theorem [2] leads to the question whether any linear combination of acyclic preferences expressed by any 
kind of marks is also acyclic. The example bellow illustrates that in general it is not true: 

/ 3 1 \ / 1 2 \ 

Mi = 2 1 , M 2 = 1 3 , Mi + Mi = 
\ 3 1 / \ 1 2 / 

The preference instance induced by Mi + M2 has the cycle 1,2,3, while both Mi and M2 are acyclic (both 
produce global preferences). 
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Note, that a linear combination of two preference system matrices can give duplicates in the marks of a 
single node, which generates ties in preferences. Ties affect existence and uniqueness of a stable configuration, 
depending on how they are handled. If a peer prefers a new node to a current collaborator that has the same 
mark, existence is not guaranteed (but if a stable configuration exists, it is unique). If not, existence stands, 
but not uniquenesfl 

However, Theorem [2] provides in conjunction with Theorem[T]a way of constructing a tie-less acyclic instance 
that can take into account several parameters of the network, as long as all produce acyclic preferences. The 
parameters can be converted into integer symmetric marks using Algorithm [TJ A linear combination using 
Q-independent scalars produces distinct acyclic marks. 

5 Graph properties of stable configurations 

Many protocols use preference systems that come from global and symmetric marks. Studying the properties 
of the stable configuration for such protocols may give information on the performances one can expect. In 
this Section, we study connectivity properties for three cases. Connectivity is extensively studied since Watts 
survey [17] on the small world graphs. These graphs are known to have good routing and robustness properties. 
They are characterized by a small (i.e. 0(log(n)) mean distances and high (i.e. O(l)) clustering. The clustering 
coefficient is the probability for two vertices x and y to be linked, giving that x and y have at least one common 
neighbor. 

The cases we considered all involved a set of n = 2500 peers, and differ from the marks: the first uses a 
global mark matrbJIjthe second a random symmetric mark matrix, and the last a latency mark matrix from the 
Meridian Project |13| . 




(a) Diameter (b) Clustering coefficient 



Figure 1: Diameter and clustering coefficient of latency, random symmetric and global marks (2500 nodes) 
stable configurations. Global marks use an underlying Erdos-Renyi £(2500,0.5) acceptance graph. 

Figure Q] shows the properties of the stable configuration for these three marks, as a function of the quota b 
on the number of links per peer. 

Global marks produce configuration with disconnected cliques of size 6+1 (maximal clustering, and infinite 
diameter). We had previously observed this clusterization effect in [4]. It can be lessened by using an Erdos- 
Renyi acceptance graph. Then the configuration still has a high clustering coefficient, and a high, but finite 
diameter (same order of magnitude than f). This is due to a stratification effect: peers only link to peers that 
have marks similar to them [3]. 

Random symmetric matrix produces configurations with low diameter and clustering coefficient. There 
characteristics are similar to those of Erdos-Renyi graphs. 

2 Irving and Manlove have performed a rather complete study on ties [7]. 
3 In absence of tie, all global marks are the same up to permutation 
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Real latencies produce both a low diameter and a high clustering coefficient. This indicates that the stable 
configuration has small-world structure. But it is not a scale-free network [12], because the degree distribution 
does not follow a power law (the degrees are bounded by b). 

6 Discussion and future work 

Stability Decision, whether a stable configuration is a good thing or not, depends on the characteristics and 
needs of practical applications. If continuous link alteration has a high cost (like in structured P2P networks), 
or if the stable configuration has appealing properties (like the small-world properties observed for latency- 
based stable configuration), then it is interesting to let the system converge. On the other hand, we observed 
that global marks result in a stable configuration with high diameter, which is an undesired feature in most 
cases. Moreover, some systems like gossip protocolspQ take advantage of constant evolution of the corresponding 
acceptance graph. In such cases, the eventual convergence would be harmful. 

Convergence speed The convergence speed is an important characteristic, whether the stable solution is desired 
or not. In the first case, the application is interested in speeding up the process. In the second case, the 
slower possible speed is preferred instead. Although this question is out the scope of the present work, our 
current experiments suggest that the convergence depends on many parameters: the preference system used, 
the acceptance graph, the activity of peers (details of peers' interaction protocol), the quotas and others. If 
we use as time unit the mean interval between two attempts of a given peer to change one of its neighbors, 
then preliminary results show that convergence is logarithmic at best, and polynomial at worst. We plan on 
providing a complete study on the influence of parameters. This should help understanding existing protocols 
and making them more efficient. 

Dynamics of preference systems We have considered fixed acceptance graph and preference lists. In real 
applications, arrivals and departures modify the acceptance graph, along with the discovery of new contacts (a 
toy example is BitTorrent, where a tracker periodically gives new contacts to the clients). The preference system 
itself can evolve in time. For instance, latency can increase if a corresponding link has a congestion problem. 
A complementary preference system is dynamic by itself: as a peer gets resources from a complementary peer, 
the complementarity mark decreases. 

All these changes impact the stable configuration of the system. The question is to know whether the 
convergence speed can sustain the dynamics of preferences or not. Fast convergence and slow changes allow the 
system to continuously adjust (or stay close) to the current stable configuration. Otherwise, the configurations 
of the system may be always far from a stable configuration that changes too often. The preferable behavior 
depends on whether stability is a good feature. This is an interesting direction for the future work. 

7 Conclusion 

In this article, we gave formal definitions for a 6-matching P2P model and analyze the existence of a stable 
configuration with preference systems natural to P2P environment. The term stability in our case corresponds 
to Pareto efficiency of the collaboration network, since the participants have no incentives to change such links. 
We also showed that in contrast with systems based on intrinsic capacities, a latency-based stable configuration 
has small-world characteristics. 
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